A GEOMETRIC APPROACH TO THE CASCADE 
APPROXIMATION OPERATOR FOR WAVELETSQ 
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This paper is devoted to an approximation problem for operators in Hilbert space, 
that appears when one tries to study geometrically the cascade algorithm in 
wavelet theory Let 7i be a Hilbert space, and let % be a representation of L°° (T) 
on 7i. Let R be a positive operator in L°° (T) such that = 1, where U 

denotes the constant function 1. We study operators M on TC (bounded, but non- 
contractive) such that 

n(f)M = M7r(f(z 2 )) and M*tt (/) M = vr (R*f) , / 6 L°° (T) , 

where the * refers to Hilbert space adjoint. We give a complete orthogonal ex- 
pansion of 7i which reduces n such that M acts as a shift on one part, and the 
residual part is = H [M n 7i] , where [M n 7i] is the closure of the range of 

M n . The shift part is present, we show, if and only if ker (M*) 7^ {0}. We apply 
the operator-theoretic results to the refinement operator (or cascade algorithm) 
from wavelet theory. Using the representation tc, we show that, for this wavelet 
operator M, the components in the decomposition are unitarily, and canonically, 
equivalent to spaces L 2 (E n ) C L 2 (R), where E n C R, n = 0, 1, 2, . . . , 00, are 
measurable subsets which form a tiling of R; i.e., the union is R up to zero mea- 
sure, and pairwise intersections of different E n 's have measure zero. We prove two 
results on the convergence of the cascade algorithm, and identify singular vectors 
for the starting point of the algorithm. 
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Terminology used in the paper: 



T 


the one-torus 


/' 


Haar measure on the torus T 


Z 


the Zak transform 


zxz- 1 


transformation of operators 


n 


a given Hilbert space 


7T 


a representation of L°° (T) on 7i 


R 


the Ruelle operator on L°° (T) 


M 


an operator on TC 


R*,M* 


adjoint operators 



1. INTRODUCTION 

In wavelet theory, one is given a subband filter, i.e., a function uiq on the unit circle, 
satisfying (^-([I]) from below and one wants to construct a scaling function ip (relative to 
mo), i.e., a nonzero function (p on R satisfying the scaling relation <p = Mtp (relation (|1 . 1| ) 
in the paper). Here M = M mo is the so-called cascade operator defined by 



(Mh) (x) = V2^a n h{2x 



n) 



where a n are Fourier coefficients of the function m . The scaling function ip is important 
because its shifts generate (under some analytic conditions) the so-called multiresolution 
subspace V = V (cp), which is used to construct the wavelets. 

There are several ways of constructing a scaling function ip. The cascade algorithm 
is one of the possibilities. In this algorithm one picks some simple function h and considers 
its iterations M n h. Clearly, if the iterations M n h converge to a nonzero function (p, the 
function ip satisfies <p = Mcp, so the algorithm gives us a scaling function. We study the 
problem of convergence of this algorithm in the setting of an abstract Hilbert space. The 
cascade operator M has some very special structure. The Ruelle operator R, which appears 
naturally in this type of problem, gives us a way to describe this structure. Namely, the 
operator M is what we shall call a sub-isometry; see Definition IOI. In fact the concept 



depends on a pair (R, n) where R is a Ruelle operator and tc is a representation. It turns 
out that sub-isometries admit an analogue of the Kolmogorov-Wold decomposition for usual 
isometries. Using this decomposition, we obtain results about convergence of the cascade 
algorithm in an abstract Hilbert space setting, and then, in the last section, give some 
applications for wavelet theory. 

This paper is technically concerned with approximation problems for operators in 
Hilbert space, but our initial motivation is the refinement operator (alias the cascade approx- 
imation operator) from wavelet theory. Our starting point is a given function m G L°° (T) , 
which is assumed to satisfy the following three axioms: 



(i) mo is continuous in an open neighborhood of z — 1 in T, 

(ii) m = V2 at z — 1, 

(iii) |m (^)| 2 + |m (-^)| 2 = 2. 

In applications (see ||Dau92|| ), mo is called a subband filter, and ([n]) is called the low pass 
filter property. That is because of the substitution z = e~ luJ , uj frequency; and (|n]) implies 
that all signals pass at u — 0, while (|n|) and (|) imply that "almost all" pass for small u, i.e., 
low frequencies. Finally, axiom (|m|) accounts for the name quadrature filter, or "quadrature 
mirror filter". They are used in wavelet theory for generating multiresolutions. Suppose 
mo (z) = Yl n & a nZ n is the Fourier expansion of mo- A function (p G L 2 (R) is called a scaling 
function (relative to mo) if 

(1.1) <p (x) = V2 a n (p (2x - n) , 
and we refer to the operator 

(1.2) (Mh) (x) = V2j2a n h(2x-n) 

as the cascade operator. The approximation problem is that of finding conditions on the 
three functions m , <p, and h (h G L 2 (R)), such that 

(1.3) lim ||^-M^|| L2(R) = 0. 

n— >oo v ' 

This is called the cascade approximation, or the cascade problem. Let R be the Ruelle 
operator (Section ^ corresponding to mo- The basic connection between the two operators 
R and M will be studied in a geometric Hilbert-space setting (Section [|), where we give a 
quadratic form h 1— > £>2 (^) on L 2 (R) with values in L 1 (T). We then introduce the concept 
of sub-isometries. (The cascade operators M are examples.) We show that R(p2{h)) = 
Pi (Mh), so if Mp = ip, then p 2 (<p) is an eigenfunction for R. In addition to L 2 -convergence 
questions, there are various classes of pointwise convergence issues. It turns out that, when 
mo is given, satisfying (0)-([m]), and ip, h satisfy natural criteria, then there is an unexpected 
obstruction to the approximation ( |1.3|) . If ker(M*) 7^ {0} in L 2 (R), then the cascade 



approximation ( |1.3|) is exceedingly "bad". We show (Proposition |8.2j ) that ker(M*) = {0} 
if and only if mo does not vanish on a set of positive measure C T. So if, for example, m 
is a polynomial, then ker (M*) = {0}. 



2. SOME MAIN RESULTS IN THE PAPER 

When ker (M*) is not an obstruction, we show the following result. We say that 
/ G L 2 (R) is orthogonal if the translates {/ ( • — n) \ n G Z} form an orthonormal family in 
L 2 (R). Let ip be a scaling function relative to some mo satisfying (fl)- ([ul|) , and let h G L 2 (R). 
Suppose both <p and h are orthogonal in this sense. Then we show that the following two 
conditions are equivalent: 



(a) \\(p - M n h\\ 2 - — ► 0, and 

(b) the function 



n^oo 



p(t) = J2 eint I y{x + n)h (x) dx 



is continuous near t — 0, and p(0) = 1. 

The result fails when ker (M*) 7^ {0}; see Section || 

In studying this, and other related approximation problems in earlier papers ||BrJo97| , 
BrJo98|1 , the following general and geometric Hilbert-space framework proved useful. Let 
be given subject to (|)-(|ujD- (For much of it, only (juD is needed.) We introduce the Ruelle 
operator 



(T) 



(2-1) (i^)(^) = ^KH| 2 CH, £gL c 

w 2 =z 

and its dual, 

(JT0 (z) = \m (z)\ 2 £(z 2 ). 

Let Tt be an abstract Hilbert space, and let f be a representation of L°° (T) on TL. An 
operator M in Ti is said to be an (R, Tc)-isometry if 

(2.2) M*vr (0 M = tt (iT (0) for all £ G L°° (T) . 

We also call M a sub-isometry when (_R, 7r) is understood. The cascade operators are special 
cases of (R, 7r)-isometries (see Section ||), and several of our results for the cascade oper- 
ator will be derived from a general result about (R, 7r)-isometries; see Section §. We now 
summarize briefly our main result for (R, 7r)-isometries. Let M be an (R, 7r)-isometry on 
some Hilbert space H, and let C := ker(M*), and H {co) := |X=i where [•] refers 

to norm-closure in TC. We then establish (Theorem |6.2|) the following general orthogonal 
decomposition, 



(2.3) n = J] e [M n c] © n 



(00) 



n=0 



and, moreover, we show that each of the (mutually orthogonal) component spaces [M n £] 
and H.(°°^ is invariant under 7r(£) for all £ G L°° (T). In other words, the spaces in the 
decomposition reduce the representation 7r; if the restricted representations are denoted 
ii n , we have n = ir © %i © • • • © tToo. (See |Pix96|| for the theory of decompositions of 



representations.) The significance of having the terms TC^ in the sum (|2.3j ) invariant under 
the representation tt is that the structure of the subspaces Ti.^ themselves may then be 
determined from the representation. We show that generically thesequence 7i n = 7r| W (n) is 



determined from ttq, and the latter can be computed from the spectral theorem. We also show 
that every representation of L°° (T) occurs as n in some decomposition ( |2.3| ) corresponding 
to an (R, 7r)-isometry. 

While the operator M plays a central role in the wavelet literature (see, e.g., 



UoRy95|| , |Pau92|| ), the approach (f2.2|)-(f2~3|) here is new. It is motivated by 



the need for a representation-theoretic approach to the classification of wavelets [ [Br Jo97|| , 
and also by the need for including in the analysis other Hilbert spaces 7i than just L 2 



[ Jor98 



see 

Even if L 2 (M) is the final goal, there is a need for understanding the limiting cases 
when some different Hilbert space 7i (other than L 2 (IR)) is dictated by some more general 
or different framework and analysis; we refer to ||Jor98|l for more details on this viewpoint. 
In ||Jor98|1 , a framework is adopted which is much more general than the present setting 
of quadrature mirror filters. (It includes, for example, the orthogonal harmonic analysis 
of 7i = L 2 (/i) from [|JoPe98|l where /i is taken to be a self-affine and singular probability 
measure on IR d of compact support which arises by iteration of a given finite set of affine 
mappings in IR d .) Our present approach to multiresolution analysis is inspired by the Lax- 
Phillips scattering theory |[LaPh89|l for the classical wave equation. This is the continuous 
case; our aim here is to "discretize" the Lax-Phillips scattering theory, and fl2.2|) should be 
viewed in that light. The recent papers by Micchelli et al. | MiPr89|| , |[Mic96 



take 

a somewhat different (but closely related) approach to the discretization problem: Since the 
functions in a multiresolution subspace V (<p) may be represented by sequences (£n)n S z; via 




( ' — n ) ' the operator M may therefore be studied alternatively as acting on one of 
the sequence spaces £ p (Z). In this guise, it takes the form M — > £ = M, where 



(2.4) 



fcez 



The connection to the sequence spaces is taken up here in Section |] below; see especially 
Theorem |4.7| and Table [1| The operator on L 2 (T) which corresponds to ( |2.4j) is simply 
/ i— > m (z) f (z 2 ) when m (z) = Xlnez a n zn is the Fourier series of some given filter function 
m. 

The motivation for the present paper has several sources. In ||BEJ97|| , we showed 
that certain wavelets, for which the corresponding quadrature mirror filter m is a polyno- 
mial, may be classified by the labels of a corresponding family of irreducible representations 
of the Cuntz algebra 0<i- Previously there were known no such clear-cut invariants that clas- 
sified wavelet families. But the particular wavelets from [BEJ97| had in fact been identified 
earlier (without invariants or classification) in |[Wel93|l and ||Wic93|l . Regularity questions for 
the corresponding scaling functions had been studied in |DDL95|1 , [ GM W94jj , and [|MRV96|1 . 
Our references for quadrature mirror filters are [|5tNg96|| and ||Mal98 



Excellent references 



to wavelets from the operator-theoretic viewpoint are ||HcWe96| and |[Hor95|| . The refine- 
ment operators of ( |2.4|) are also called slant Toeplitz operators, and their spectral theory was 
studied previously in ||Ho96|| . A standard reference to the ergodic theory which we use in 



the present paper is [|Wal82 . 



3. BACKGROUND, SUMMARY, AND OPERATOR RELATIONS 

Let T denote the one- dimensional torus and let /i be its (usual) Haar measure. We 
will use the form T ~ IR/27rZ such that functions on T are identified with 27r-per iodic 
functions on R. For technical reasons, the identification will be made via z = e~ luJ , w 6 K, 
and functions on T will be written, alternately, as / (z) or / (a>). Let m G L°° (T) be given, 
and suppose m = a/2 at z = 1, and further that 

(3.i) K02)| 2 + K(-z)| 2 = 2. 

One approach to wavelets (see [Pau92|| ) is to first make precise the (formal) infinite product 
(limit asn-» oo): 

n 1 OJ 

( 3 - 2 ) II 7f m ° (^) X[-2n Wl 2^] M • 

k=l 

Suppose this product ( |3.2| ), in the limit, represents a function F G L 2 (R). Then 
(3.3) V2F (2a;) = m (u) 

and we wish to recover the scaling function ip as the inverse Fourier transform of F. The 
approach is referred to as the M allot algorithm |[Mal89|| , but it is somewhat indirect. Let 



(3.4) (f (x) = — / e luJX F (u) duo 



>3C 



OO 



be the inverse Fourier transform, <p = F , and let 
(3.5) m (z) = 



a n z n 



be the Fourier series expansion of m . (The Fourier basis in Q3.5|) is e n (z) = z n .) Then 
(again formally), 



(3.6) <f (x) = \f2 s y^ j a n Lp (2x — 



n) 



The function ip is called the scaling function, and the closed linear span of the translates 

(3.7) {</?(• -n) | n G Z} 

generates, under certain analytic conditions [ Dau92 |, a multiresolution subspace V = V (p) 
in L 2 (R). The identity ( p.6| ) allows, for each mo, an operator M = M mo in L 2 (M) such that 
scaling functions <p arise as solutions to a fixed point problem. If we introduce the operator 
M by 

(3.8) (Mh) (x) = V2^2a n h(2x-n), he L 2 (R) , 
then 



(3.9) 



(p = Mtp, 



and we should look for choices of h such that 

(3.10) lim M n h = tp. 



We will be interested here primarily in this as an L 2 (IR)-limit, but other limits (e.g., point- 
wise) will be considered as well. The issue is both how preassigned properties of the starting 
function h, and the scaling function ip, affect the approximation properties of ( |3.10| ). We 
refer to this as the cascade approximation, and the traditional choice for h is h = Xroip 
which accidentally is the scaling function for the Haar wavelet. The cascade algorithm is 
more direct than the Mallat one, as it doesn't pass via the Fourier transform. 

The right-hand side in (|3.S| ) involves a dyadic scaling, and an action by L°° (T) on 
L 2 (R). We now introduce a representation a i— > 7r (a) of L°° (T) in the algebra of operators 
on L 2 (R) by 

(3.11) (tt (a) h) (x) = a n h (x — n) . 

n&L 

Then M can be represented as M = D7r(mo), where D is the operator of dyadic scaling, 
Dh(x) = \/2h(2x). Application of the Fourier transform ^ to both sides in (|3.11| ) then 
yields 

(3.12) h\ — >a((j)h(u) 
via the identification 

a(u)~a(z), z = e~ iuJ , wGl 



(In applications, a; is a frequency variable.) In either of its forms, Q3.11|) or Q3.12j ), this 
representation of L°° (T) will be denoted tt (a) h with n (a) an operator acting on L 2 (R), 
and 7r (L°° (T)) the corresponding algebra of operators. 

If mo is given, then L°° (T) carries two operators R and R*, R defined by 

(3.13) (Ra)(z) = ^J2 \m H\ 2 a(w) 

w 2 =z 

and the adjoint operator R* by 

(3.14) {R*P) (z) = \m (z) | 2 (3 (z 2 ) , a, (3 e (T) . 

Both of them will also be viewed as L 2 (T)-operators, and we have, by a simple calculation, 

(3.15) / (Ra) {z) (5 (z) dfx (z) = [a (z) R* (3 (z) dpi {z) , 

where dfx is the usual Haar measure on T, i.e., ^ • • • du, thus justifying the notation R*. 



The operator R is called the Ruelle operator, or the transfer operator, for reasons we shall 
go into later; see also [|CoRy95|| . 



We further have the usual pairing between M in (|£gD , and its L 2 (R)-adjoint M* . 

given by 



/ 

Jr 



Mhi (x)fi2 (x) dx = J hi (x)M*h 2 (x) dx, 

Jm 

or equivalently, 

(3.16) (Mhi\h 2 ) = (h 1 \M*h 2 ), 

where ( • | • ) denotes the standard inner product of L 2 (R). 
Our first result is 

THEOREM 3.1. Let m e L°° (T) be given, and suppose it satisfies Q). Let M 
be the corresponding cascade operator, and R the Ruelle operator. The respective adjoints 
are M* and R* . Finally, let ir be the representation of L°° (T) on L 2 (R) given in ( p. 12 ). 
Then we have the following two commutation relations: 

(a) M*tt (a) M = n (R*a) and 

(b) Mn(a)M* = it (Rat) + Tr{R(e 1 a))Tx for all a G L°°(T), w/iere e x (z) = z and 
^Ti/i) (x) = h(x + \), h G L 2 (R). 

The proof will be given in Section [| below. In this paper, we will study the cascade 
approximation, and the scaling function tp, via the abstract algebraic system which is given 
by the relations of the theorem. These two operator commutation relations will first 

be studied abstractly (Section |) and independently of their origin, and then the results 
will be specialized, and applied, to the wavelet problems mentioned above. Our proofs will 
depend on some lemmas of a general nature regarding the Zak transform. 



4. THE ZAK TRANSFORM 

The Zak transform Z is known |Dau92] , p. 109] to be the isometric isomorphism 
between L 2 (R) and L 2 (T x [0, 1]) which is given formally by 



(4.1) 



(Zh) (z, x) = z n h (x + n) , he L 2 (R) , x G R. 



neZ 



It is studied in [ pau92| , p. 109], | |BJR97| , and elsewhere. To make it precise, it is convenient 
to identify its range with functions H on T x R which satisfy the following scaling rule: 

(4.2) H (z,x + n) = z~ n H (z, x) for all z G T, x G R, n G Z. 

The norm \\H\\, or ||i?|| 2 , will be given by 



(4.3) \\H\\ 2 = / \H(z,x)\ 2 dfj,{z) dx, 

J Jo 



and it can be checked, see |Pau92| , p. 109], that 



(4.4) \\H\\ 2 = \\h\\ 2 

\h (x)\ 2 dx, 

and that H = Zh is an isometric isomorphism of L 2 (R) onto the Hilbert space Tiz of 
functions which are defined by the scaling formula ( |4.2|) , and completion in the norm || • || 2 of 



( |4.3| ). The simplest wavelet scaling function is (fn '■= Xroi] °f the Haar wavelet. (The Haar 
wavelet itself is generated by ipn = X\ ii — X\i y) The Zak transform Z has the pleasant 

feature that Z {^h) is the constant function II in L 2 (T x [0, 1]). 

A second advantage of the Tiz formulation ( ^4.2| ) is that it makes clear a useful direct 
integral representation 



(4.5) Hz = / H {£) dfi (z) , 

where each Ti (z) is the Hilbert space {h z } of functions h z on E satisfying the ^-scaling rule, 

h z (x + n) = z~ n h z (x) , for all x G M, n G Z. 



We refer to | Dix69|| for details on direct integrals, and note that each 7i (z) is naturally and 
isometrically isomorphic to L 2 (0, 1). Thus Z is a transform representing L 2 (IR)-functions as 
direct integrals (over T with Haar measure) of copies of L 2 (0, 1). 

The following lemmas about the Zak transform are new. They are needed in the 
sequel, and are also of independent interest, we feel. 



LEMMA 4.1. Let m G L°° (T) be given satisfying |U]), and let M and R denote 
the corresponding cascade operator and Ruelle operator. Let hi, i = 1,2, be functions in 
L 2 (R) with Zak transforms Hi = Zhi. Then 

(4.6) R ((#! (z) | H 2 (z)}) (z) = (Z z Mh | Z z Mh 2 ) , 
where the inner product ( ■ \ ■ ) on both sides of (fO| ) is that ofH (z), i.e., 

(4.7) (#! (z) | H 2 (z)) = [ H^x)^ (z, x) dx, 

Jo 

and 

(4.8) R((H 1 (-)\H 2 (.)))(z) = lj2 l m o (^)l 2 ^ I H * H) • 



„2- 



and the summation in (|4.8| ) is over the two roots, w G {i^/^} C T. 



REMARK 4.2. The formula (|4.6| ) is crucial for the use of R as a transfer operator 
in the iteration of the cascade algorithm on a given starting vector h G L 2 (R), i.e., h — > 
Mh — > M 2 h Secondly, we will show in Section that (|4.6|) is equivalent to the first 



of the two commutation relations, (W), in Theorem 3.1 



Proof. Let the functions be as described in the lemma, Hi = Zhi, i = 1,2. We first calculate 
the term Z z Mh\ from the desired equation (|4.6|) . Keep in mind that all integrals and sum- 
mations are convergent relative to the respective Hilbert-space norms (due to the isometries 
which we described before the statement of the lemma). Details are in |Pau92| , p. 109] and 



BJR97 



Let 

(4.9) (Uhi) (x) := hi (| 

Then Mhi = U^ 1 ^ (mo) hi, and we now calculate the Zak transform of the two individual 
operators making up M. First, hi i— >• 7r (mo) hi transforms into Hi \—> tuq (z) Hi (z, ■), since 

Z (n (m ) hi) = z n (tt (m ) hi) (x + n) 

n 

= z n a k hi (x + n — k) 

n k 

= a k zk (x + n-k) 

k n 

= 2J a>kZ k 2J ^"^1 ( x + n ) 

k n 

= m (z) Hi (z, x) 

as claimed. Note that all summation indices range over n, k G Z. We have £ 2 (Z)-convergent 
summations, i.e., relative to the respective ^ 2 -norms, and the exchange of the summations 
is justified by the norm-isomorphism property of the Zak transform. 

We now turn to the operator U~ given by U^ 1 = ZU^Z*, where Z* is the adjoint 
of Z. Since Z is a norm-isomorphism, we have 

ZZ* = id L 2 (Tx [ 0)1 ]) , and Z*Z = id L 2( R ) , 

where id refers to the respective identity operators. 
We claim that 

(4.10) (Z*Hi) (x) = [ Hi (z,x) dfi(z) . 

J 



The proof is the following calculation, for h G L 2 (R): 



(Hi\Zh) 




Hi (z, x)Zh (z, x) dfj, (z) dx 



T JO 



Hi (z, x) z n h (x + n) d\i (z) dx 

JtJo 

/ / y Hi (z, x + n)h (x + n) dx dji (z) 
JtJo „ 




Hi (z, x)h (x) dx dfi (z) 




Hi (z, x) dfj, (z) h (x) dx 



'ffi JT 

(Z*Hi | h) 



which proves the stated formula ( 4.1C ) for Z* . 
We now calculate 



U^HA (z, x) = (ZU^Z*) Hi (z, x) 

= {U^Z^Hi) (x + n) 

n 

= ^VV2(Z*/f 1 )(2x + 2n) 

n 

= J2z n V2 [ Hi((,2x + 2n) d//(C) 
= V2J2z n f C 2n Hi(C,2x) dfi(() 

JT 

n 

= V2J2 z n I c n \ £ H i & 2x ) d » (o 

^^=J2 H x(w,2x), 

w 2 =z 

where the last step simply represents the Fourier series of the final function. The formula 
(4.11) U^Hi^x) = Hi(w,2x) 



is basic in later proofs. Combining the three formulas ( |4.8| ), ([4.10 ), and (|4.11 ), we arrive at 

(4.12) (Z z Mhi) (x) = 4= Yl m o M Hi (w, 2x) . 

v2 



This is needed in the calculation of the right-hand side in (4.6) from the lemma as follows: 



(4.13) 



(Z z Mh! | Z z Mh 2 ) 



(Z z Mhi) (x) (Z z Mh 2 ) (x) dx 
I /-i 

- / y] m Q (wi) Hi (wi, 2x) 22 m o ( w z) H 2 (w 2 , 2x) dx 

W^=Z w 2 =z 

I /■! 

2^Z^2 m ° (^i) m o (w 2 ) / Hi (wi, 2x)H 2 (w 2 , 2x) dx 

...i ...i . Jo 



wf=wi=z 



7 5Z5Z m ° M m o M 



wi=wi.=z 



Hi (wi, x)H 2 (w 2 , x) + Hi(wi,x + l)H 2 (w 2 , x+1) 
7 5Z5Z m ° (^i) m o M 



dx 



wi=wZ=z 



dx 



Hi (wi, x)H 2 (w 2 , x) + w 1 w 2 1 Hi (wi, x)H 2 (w 2) x) 

- y^y^ m (wi)m (w 2 ) (l + WiW^) / H x (wt, x)H 2 (w 2 , x) dx. 
^00 Jo 



Wt=Wn=Z 



But if wi 7^ w 2 in the summation, then the factor (l + WiW^ 1 ) = 0. If Wi ^ w 2 , then 
w 2 = —Wi and u^u 1 ^ 1 = — 1. If Wi — w 2 , then (l + WiW^ 1 ) = 2. Continuing the calculation, 

we get 

(Z z Mhi I Z z Mh 2 ) = - V I mo (w) | 2 / Hi (to, x)ff 2 (w, x) 

= ^El m oHI 2 (^iHI^H) 

«J 2 =Z 

= ja«^ 1 (.)|fl a (-)»W, 

which is the final conclusion of the lemma. □ 

Having formula (|4.11 ) for U^ 1 , we may derive the corresponding formula for U = 
ZUZ*, but we isolate it here in a separate lemma. 

LEMMA 4.3. Let U be the scaling operator fl4j) on L 2 (R), Uh (x) = (f), and 

Zei J7 = ZUZ* be the corresponding operator on Tiz — L 2 (T x [0, 1]) 0/ i/ie Za/c transform. 
Then 



(4.14) 



Proof. 

ZUZ*H(z,x) 

= ^z n (UZ*H) (x + n) 



v n x ' 

75 E »" // (c f + *) * (0 + ^ E / * (c, ^ + *) *. (o 



x + 1 



1 ^r/,?u^r, 2 ,^ 



<MC) 



v/2 

where, in the last step, we used the standard Fourier series representation of the respective 
functions. □ 

Our interest in the use of the Zak-transform approach to cascade approximation 
started in an earlier paper PrJo98|| on the special case of compactly supported scaling 



functions. It is known |[Dau92|| that compact support of ip (as a scaling function in L? (R)) is 
equivalent to the filter function mo ( • ) being a polynomial, i.e., the sum in ( |3.5| ) being finite. 
We encountered there the following two sesquilinear forms, both indexed by T: 



(4.15) p x (fit, h 2 ) {e- iuJ ) =J2h(u + 2nn)h 2 (u + 2vm) , 

where the functions hi, h 2 are in L? (R) and of compact support in the x- variable. This puts 
the Fourier transform hi, i — 1,2, in the Schwartz class so that the sum in (|4.15|) is well 
defined. But problem was how to most efficiently remove the compact-support restriction on 
the functions h^. (Note the individual terms in the sum on the right-hand side in ([4.15 ) are 



not periodic. But the sum serves to "periodize" the function u i— ► (fiih^j (u)), uj G R.) 
The other sesquilinear form was 

(4.16) p 2 {hi, h 2 ) (z) = zk I h i ( x ~ k ) h 2 (x) dx. 

k&z ^ M 

With the compact-support restriction, this is even a finite sum. But introducing the Poisson 
summation formula, or by a direct Fourier series expansion of pi (hi, h 2 ), we note that, with 
z = e~ iw , we have 

(4.17) Pi(h 1 ,h 2 )(z)=p 2 (h 1 ,h 2 )(z). 

When pi (hi, h 2 ) is viewed as a function on T, its fc'th Fourier series coefficient computes out 



directly to be L hi (x — k)h 2 (x) dx, and ( |4.17| ) follows from this. 



The following result makes it clear that the compact-support restriction can be re- 
moved by use of the Zak-transform approach, and as a bonus, we get some a priori estimates 
that are needed later. 

PROPOSITION 4.4. Let hi e L 2 (R) be of compact support. Then the two forms 
Pi and P2 coincide with p 3 , where 



(4.18) p 3 (hi, h 2 ) (z) = (Z z h x | Z z h 2 ) = / Zh 1 (z,x)Zh 2 (z,x) dx. 

Jo 

Proof. In the following calculation, convergence is governed by the norm-isometric property 
of Z, and this also justifies the exchange of summations and integration: 

(Z z h x | Z z h 2 ) = [ Y] ( x + k )y2 zlfl2 ( x + dx 
Jo k i 

= y^y^ z l ~ k / hx(x + k)h 2 (x + I) dx 

k I Jo 

= ^2 zH ^2 hx(x + l- n)h 2 (x + I) dx 
i Jo 

/oo 
hi (x — n)h 2 (x) dx 

= p 2 (h l7 h 2 ) (z) ; 

since we already proved the identity p\ = p 2 (in ( |4.17| )), the proposition follows. □ 

Having established p\ = p 2 = p 3 , we will use p to denote the common form. Since 
jt>3 is defined for all pairs hi in L? (R) , the compact-support restriction involved in the 
formulation of p\ and p 2 has been removed. 



COROLLARY 4.5. Let p be the form on L 2 (R) x L 2 (R) which is defined in flCIg 



and taking values in functions on T. Then in fact p takes values in L 1 (T) ; i.e., the left-hand 
side of ( [4.2 1|) felow zs /im'te if h%,h 2 G L 2 . For restricted pairs hi,h 2 of L 2 (R) -functions, 
p{h\,h 2 ) can also be checked to take values in L 2 (T), and mi/i one more restriction in 
L°° (T), i.e., the left-hand sides of ( |4.2U| ) and ( |4. 19| ) , below, are finite with suitable restric- 
tions on hi and h 2 . The restrictions are those which make the right-hand sides of ( 4.19Q 
and (gT2p) /mite. ITie respective bounds d4T9|) , ( ^20|) , and flOll) are as follows: 

(4.19) lb (^1)^2) || oo < esssup ■ ||^/i 2 || , 

z 

(4.20) Ib(/*i,/i2)l| 2 < lb(^A)lli-INI 2 > 

(4.21) lb(>H,Mli < ||/n|| 2 . ||/i 2 || 2 . 



Proof. We begin with ( [4.21] ) since it is universal. Before the estimate, we may restrict to hi 
of compact support, and then, after the fact, this restriction is removed by completion. 



In the following estimates, we use the Cauchy-Schwarz inequality two times, for the 
respective Hilbert inner products involved: 



lb(/ii,Mli = / \{Z z h 1 \Z z h 2 )\ dfi(z) 

JT 

< / \\Z z hi\\ ■ \\Z z h 2 \\ dfi (z) 

JT 

< ^ ||^ 2 /ii|| 2 dfi(z) ■ J \\Z w h 2 \\ 2 dfi (w)j 

proving ( |4.21|) . 

Formula ( |4. 19| ) for \\p (hi, /i2)|| 00 is trivial to check, but it is not clear which condi- 
tions on the hi s make the right-hand side finite. 

Formula Q4.20Q for \\p (hi, h 2 )\\ 2 is useful later since we will be able to check finiteness 

1 

of the factor \\p (hi, hi)\\^. In fact in many cases, we will end up with p (hi, hi) a constant 
function on T. Checking ( (4.20[ ) goes as follows: Let hi G L 2 (R), and suppose p(h\,hi) G 
L°° (T). Then 

\\p(hi,h 2 )\\ 2 2 = / \(Z z hi \Z z h 2 )\ 2 dfi(z) 

JT 

< / \\Z z hi\\ 2 ■ \\Z z h 2 \\ 2 dfi (z) 



< esssup \p(hi, hi) (z)\ ■ / \\Z z h 2 \\ 2 dfi(z) 

z JT 



= IbCii.Mloo • \\ zh 2\\l 

= lb (huh) II ^ • \\h 2 \\ 2 2 , 
which is exactly fl£2Cp . □ 



We now use the estimates from Corollary [4.5| to examine boundedness properties of 
the representation n of L°° (T) on L 2 (R) which was introduced in ( |3.11| ), i.e., ir (a) h = a*h, 
a G L°° (T), h G L 2 (R). When using L 2 (IR) ~ Hz (via the Zak transform), a unitarily 
equivalent form of the representation is (also denoted tt) the following: 

(4.22) 7r(a)H(z,x) = a(z)H(z,x), for a G L°° (T) , H G Hz- 

The idea from the proof of Corollary ^4.5| yields imediately 



\\n(a)H\\ Hz < WaW^ ■ \\H\\ Hz , 

which is the bound which is required for a representation. In studying scaling vectors, 
however, we shall also need, for fixed H G Hz, boundedness properties of the map 

(4.23) C H :u\ — >n(a)H 

from L 2 (T) to Hz- 



PROPOSITION 4.6. The mapping Ch- ol \— > n (a) H in (|4.23 ) is bounded from 



L 2 (T) to Tiz if and only if ' p 2 (H) = p(H,H) is in L°° (T), and then the norm of Ch is 



\\P2 (-H)\\%o- Moreover, Ch has a bounded inverse if and only ifp2 (H) has an L°° (T) inverse, 
i.e., there is some e G R+ such that p 2 (H) (z) >e a.e. on T. 



Proof. We compute 



Ch (a) 




\a(z) H (z, x)\ 2 dx dfi (z) 



T JO 



|at(^)| / \H(z,x)\ dxdfi(z) 
Jo 

\a {z)\ 2 p 2 (H) (z) d^i (z) . 



liaeL 00 (T), then 



\C H {a) 



< a 



\a\ 



p 2 (H) (z) du.(z) 



\H\ 



Hz ' 



and the assertion follows from a standard fact on multiplication operators. The same argu- 
ment also yields the condition for invertibility of Ch- □ 

The significance of the operator Ch, H G Tiz, is that if H = F is a scaling function, 
M (F) = F, where M = ZMZ~ l . Then Cp intertwines U with a special isometry in 
L 2 (T) w £ 2 (Z). Let mo be a filter satisfying in the Introduction, and let M be the 

corresponding cascade operator. Let F be a scaling function. Let mi (z) := ztuq (—z) (which 
is a high-pass filter), and define (Sjf) (z) = rrij (z) f (z 2 ), j = 0, 1, / G L? (T), z G T. Then 
it is easy to check that 



and 



(T) • 



i=0 



We say that the S^s form a representation of the C*-algebra (called 2 ) on the relations. (It 



was introduced in ||Dix64j| ; see also ||Cun77|| .) Such representations were studied extensively 
in recent papers ||Br Jo97|l . A main result from [|BrJo97|1 is that 



Pi s-l 2 (t) = {o} 



ra=l 



An isometry So with this property is called a shift; see, e.g., ||SzFo70|| . Let C := ker (Sq) 
S^L 2 (T). Then the shift property may be restated as 

oo 
n=0 

and moreover the spaces Sq C q are pairwise mutually orthogonal. 



Table 1 . Summary of Theorem [4.7| : Embedding of the isometric model into L 2 (R) 

{0} < — ••• < — V 2 (F) V X (F) V (F) finer scales 

\ \ \ 





W 2 {F) 


Wx(F) 


W (F) 


rest of L 2 (R) 


*w *w *w 

c F 

|g j < _ _ _ Sp $0 Sp 


L 2 (R) « 

J Cf '■ ot I— > 7r (a) F 
L 2 (T) 




OqJ~,q 


SqCq 


C = S\L 2 (T) 



/ / / 

S 2 L 2 (T) S L 2 (T) L 2 (T) 



THEOREM 4.7. Let t/ie setting be as above, and consider the cascade problem in 
Tiz — L 2 (R) . Then the following two conditions are equivalent: 

(i) M (F) = F, and 

(ii) Cf intertwines So and U, i.e., CfSq = UCf- 

Let V (F) = [{tt {a)F\ae L 2 (T)}] where [■} is norm-closure. Let V n (F) := U n (V (F)). 
If (D holds, then K+i (F) C V n (F), and C F (S£L 2 (T)) = V n (F). Let W n (F) := V n (F) Q 
V n+1 (F). If further p 2 (F) = 1, then C F (S%£ ) = W n (F). 

The results of the theorem may be summarized as in Table |1] above. 

Proof. (|i|) =>- (0). Suppose U~ 1 n (m ) F = F; then 

UC F (a) = Utt (a) F = tt (a (z 2 )) UF = n(a (z 2 )) tt (m ) F 

= tt (mo (z) a (z 2 )) F = tt (S Q a) F = C F S (a) for all a e L 2 (T) , 

which is (P). 

@ (@)- Suppose (|iD holds. Then the previous calculation reverses, and shows 
that M(F) = F where M = U^tt (m ), and we have (§). 
Suppose (0), and let a E L 2 (T); then 

C F S n a = ?7 n C F (a) = U n ir (a) F e V n (F) , 

and we conclude that C F (SqL 2 (T)) = V n (F) as claimed. 

Now let a,/3 G L 2 (T), and consider the following calculation: 

(C F S ia | Utt (13) F) = (C F Sia | C F S a) 

= j m l (z)m (z) a (z 2 )(3 (z 2 ) p 2 (F) (z) dfi (z) . 



Table 2. Approximation properties of m Q n ^ and D n [z 



(n) 


D n (z) = 


( n ) / \ 


2 


m { n) = S% (11) 


D n = R* n (1) 






n^oo 


fD n fd/M^f(l) V/eC(T) 
if {/| i?/ = /} is one-dimensional 




|MePa93| 






/ T m { n) dfi = (a ) n 
if m (z) = J2kLo a k zk 


/ T D ™ (*) ^ (*) = 


1 Vn = 


= 1,2,... 



If P2 (F) = 1, then this integral is 

- mi (u>)m (itf) a (z)(3 (z) dfj, (z) . 



But the choice of ni\ (z) = zm (—z) makes it zero: 

]- ^2 m i ( w ) m o ( w ) = \ 101710 (~ w ^ m ° 



0, 



since E M a=* w = o. 

We have proved that if p 2 (F) = 1, C F maps £ = Si (£ 2 (T)) onto W (F) = 
Vo (F) UVq (F), and an iteration of the same argument (induction) yields 

C F (S^S.L 2 (T)) = W n (F) 

as claimed. 

We also saw that Cp is isometric if and only if p 2 (F) = 1. □ 

The significance of the spaces Vo (F) and Wo (F) in wavelet theory is that, in the 
L 2 (R)-picture, V (F), F = Zip, is generated by the father function, while W\ (F) is gener- 
ated by the mother function. It is interesting to summarize the approximation properties of 

the two function sequences m, 



(n) 



m [z) m [z- 



■m \z 



and D„ 



m. 



In Table |2] we include results from |Br Jo9711 in the left-hand column, and results from Meyer 
and Paiva |MePa93fl in the right-hand column. The filter m is given as usual, and (0)-(pii|) 
in the Introduction are assumed. 

We now show that when tuq is given, and M is the corresponding cascade operator, 
then the solutions M (F) = F may be identified with a space of intertwining operators. We 
shall state the details in the Hilbert space Tiz of the Zak transform. 

COROLLARY 4.8. An operator C : L 2 (T) -> H z is of the form C F for some F e 
Hz satisfying M (F) = F if and only if it satisfies the following two intertwining properties: 



(a) CSo = UC, and 

(b) C (fa) = (tt (/) C) (a), for all f G L°° (T) and a G L 2 (T). 

// multiplication by L°° (T) on L 2 (T) is written r (f) a = fa, then (|]) reads: 
(§) Ct (f) = ix (f) C. 

Proof. By Theorem [4.71 , it is enough to check that every operator C : L 2 (T) — > riz which 
satisfies ~ (U) must be of the form C = Cp for some F G riz- So let C be given, 
and assume ©. Let C (1) := F. Then, for a G L°° (T), we have C (a) = Cr (a) 1 = 
n (a) CI = 7r (a) F = Cp (a). We are considering only the case when C is bounded. Since 
L°° (T) is dense in L? (T), the result follows. If (§) also holds, we saw in Theorem |4.7| that 



then M (F) = F, and the proof is completed. □ 

REMARK 4.9. An inner product on the intertwining operators. From @, 
we also get the identities 

C* H C H = r(p 2 (H)) and C* H C H , = r (p (H, H')) 

for H, H' G Tizi where 

p 2 (H) (z) = p (H, H) (z) = I \H (z, x)\ 2 dx, 

Jo 

and r (/) denotes multiplication by / on L 2 (T). Hence (as noted), ||C#|| = \\p 2 (H)^, and 
Ch is bounded if and only if p 2 (H) G L°° (T) . 

REMARK 4.10. Proposition [4.6| will be used in Sections [7|-§ in the study of scaling 
functions if G L 2 (R), i.e., solutions to M<£> = p where M is a cascade operator for some 
given filter m . If Z9? = F, then the scaling equation is equivalent to M (F) = F. Then, 
generically, Cf will be bounded, but will not have a bounded inverse. 

If mo (z) = ^ (1 + -2 3 ), then tp (x) = |X[o,3] an d for < x < 1, 

F (Z, X) = Zip ( Z , = £ (X[0,3] i X ) + ^X[-l,2] 0*0 + z2 X[-2,i] {x)) ■ 

It is easy then to check that 

p 2 (H) (z) = p (H, H) (z) = I (z- 2 + 2Z- 1 + 3 + 2z + z 2 ) 

= i(3 + 4cosu; + 2cos (2a;)), 
9 

where z = e~ luJ . In this case, Cf is not invertible as p 2 (H) vanishes on T. In fact, 
p 2 (H) (u) = | (2 cosc<j + l) 2 which, of course, vanishes for uj = i 2 ^. 

REMARK 4.11. Using the usual isomorphism L? (T) ~ f (Z) defined by the Plan- 
cherel theorem for Fourier series, we note that, if m (z) = Xlnez ™" 2 ™ > then the operator 

/ ^ m ( 2 ) / (^) ; onL 2 (T), 



takes the form 

(4-24) (S£) n = J>«-^' 



k 

k&Z 

2 



when realized as an operator on the sequence space £ (Z) via the Fourier series representation 

f( Z )=J2HnZ n , (£„)G£ 2 , J2^\ 2 = " 1,112 



2 ' 



and this is the connection to the Micchelli operator (|2.4|) mentioned in the Introduction. We 
sketch the details of this argument below, and refer to |[Mic96|| for more details. 



Starting with 27r-periodic functions m and /, corresponding to the Fourier repre- 
sentation 

m (z) ~ m (u) ~ a n z n , 



and 



with z = e w G I, as the T = IR/27rZ convention, we have Micchelli's operator S of 
( |4.24| ) or ( f2.4| ) in the function form 

(w) = m (u) f (2a;) , wGl. 

For each /c, the iteration S k f is also 27r-periodic, while (^S k f^j has period 2 fc • (27r). 

Introducing (Z7F) (x) = 2"F (§), £ G IR, on functions, or distributions, on IR, we arrive at 
the representation 

(4.25) U k S k f ~ 2-f m ( w ) m (|) • • • m Q£) / («) . 

If there is a limit function (or distribution) F^, as k — > oo, then the difference 

tends to zero in the limit A; — ► oo. Note that A fc , for each k, is acting on sequences, say £ 2 (Z). 
Hence we get the corresponding scaling function from Q4.25 ) at the dyadic rational points 



| j, k G Z} cl this way, and we have therefore made the connection to the Micchelli 



approximation of [ Mic96 1 ; see also (2.4) in the Introduction above. 

5. PROOF OF THEOREM |Q| 

Recall that Z is an isometric isomorphism, viz.: 

L 2 (R) ~ H z ~ L 2 (T x [0, 1]) , 

where the second ~ amounts to restriction from IR to [0, 1] in the x-variable: if H G Tiz 
satisfies ( |4.2j ), then the restriction H (z, x), < x < 1, defines the corresponding element 
in L 2 (T x [0, 1]), and a simple argument shows that this restriction mapping is indeed an 



Table 3. Operator correspondence between L 2 (R) and Hz (Lemma |5.1| ) 



heL 2 



He H z 



(T n h) (x) = h(x + n) 



7r (a) h = a * h 



M = U^tt (mo) 



M* = % (mo) 17 



(E^) (a) = e ite /i (x) 



(Fh) (x) = f K e- i2 ™vh(y) dy 



f n H(z,x) = z~ n H(z,x) 
7i (a)H (z, x) = a (z) H (z, x) 
MH(z,x) = 

75 m (w)H(w,2x) 

M*H(z,x) = 

^Mz)(H(z 2 4)+zH (z 2 ,^)) 
E t H(z,x) = e itx H (ze a ,x) 



FH) {e i2 ™,x) = 

e -i2nxu H ( e ~i2nx^ ^ for ^ x £ 



isomorphic isometry of Tiz onto L 2 (T x [0, 1]). It follows that operators in one space identify 
with corresponding operators in the other. If A is a given operator in L 2 (R), then A := ZAZ* 
is the corresponding operator in Hz- 

The proof of the following lemma is essentially contained in the previous section: 
see especially (|4.11|) and Lemma [4.3| . 

LEMMA 5.1. If A is one of the operators in L 2 (R) listed in the first column of 
Table |3|, then A = ZAZ* in Hz is given by the corresponding entry in the second column of 
Table H. 



Proof. In the previous section, we also elaborated on the operators n(ct), a G L°° (T), 
Uh (x) = -h^h (|), and the cascade operator 



Mh (x) = V2^a n h (2x - 



n 



for m (z) = Ylnez a n zn j representing the given low-pass filter. The present proof amounts 



to a combination of the calculations leading up to Lemma [4.3| , and the argument from the 
proof of that lemma. □ 



Proof of Theorem With the aid of Lemma |5.1| , the proof of the two commutation rela- 
tions in Theorem [TT] now amounts to the following computations. They take place 



in the space TCz, i-e., the range of the Zak transform, so it is the right-hand column in Table 
^| which is used. 

Ad (§: Let a G L°° (T) and H G H Z - Then 

= - m (z) a (z 2 ) m (w) (H (w, x) + zH (w, x + 1)) 

= - m (2) a (z 2 ) m (w) (H (w, x) + zw~ l H (w, x)) 

22 

= - m (z) a (z 2 ) m (w) (l + zw^ 1 ) H (w, x) . 



m2— w2 



But the summation is over w G {±^}, and the term 1 + 210 1 = 2 if w = z, and 1 + zw 1 = 
if w = —z. We get 

M*^{a)M) H (z, x) = I mo (*) | 2 a (V) # (z, x) , 



which is the identity (|a|) of Theorem |3.1| . 

Ad (|): As in (|), let a G L°° (T) and H e H z he given. Then 

(m^c^M*") # (2, a;) = -^= ^2 m o H a H [m*H\ (w, 2x) 

= i? (a) (2) # (2;, z) + R (eta) (z) H (z, x + , 

which is precisely the second identity @ of Theorem p.l| . Note that we obtain the identities 
in Tiz, but since Z is an isomorphism, Z: L 2 (R) — > 7^^, we automatically get the same 

identities in L 2 (R) where the translation to L 2 (R) is made via the dictionary of Lemma |5.1| 
(Table D. □ 

REMARK 5.2. The last line in the dictionary of Lemma [5.1| (Table is the corre- 
spondence for the Fourier transform T in L 2 (R), and it shows that the equivalent transform 
JF in Tiz is given by a very simple formula: it has a phase factor, and otherwise only involves 
switching of the two variables x, u), i.e., time and frequency variables, z = e l2wiJJ . It was 
included in Table |] for later use. 

6. SUB-ISOMETRIES 

Let m Q G L°° (T) be a low-pass filter, i.e., satisfying conditions ( |3.1|) and (|) in Sec- 
tion H], and let R, M he the corresponding Ruelle operator and cascade refinement operator; 
see (|2.1| ) and Q3.8Q for details. Then L°° (T) is represented as an algebra of operators on 



L 2 (R) via 7T (a) h = a * h, a G L°° (T), h G L 2 (R), c/. (|3~T2l) or ( pg ). Then we get the 
following two properties for M: 

(6.1) M*7r (a) M = 7r (R* (a)) 
and 

(6.2) M*vr(a) = tt (a (z 2 )) M*, 

for all a e L°° (T). Taking a = II in (|6J]) , we get M*M = ?r (|m | 2 ). So M is not an 



isometry unless |mo| 2 = 11. The last condition is inconsistent with the low-pass property (|) 
of m , i.e., m = y/2 at z = 1. But we say that M is a sub-isometry. More generally, let 7r 
be a representation on a Hilbert space 7i. 

DEFINITION 6.1. Let R be the Ruelle operator introduced above, i.e., R: 7i — ► 
L 2 (R), and let 7r be a representation of L°° (T) in the algebra of operators on Ti, such that 
identities (§), (0) hold for all a G L°° (T). 

We say that an operator M on TC is an (R,n) -isometry, or a sub-isometry if the 
data (i?, 7r) is understood, if 

(i) M*n (a) M = TT (if (a)) on ft, and 

(ii) 7T (a) M = Mvr (a (z 2 )) on H. 

(In the general case, the two conditions (@) an d ®j are independent. A discussion of © and 
its variant (16. 21) will follow.) 



If M is an isometry, i.e., M*M = id^, then M (TC) is closed, but it may not be 
so if M is only a sub-isometry. Then we shall denote the closure [M7{\. Its orthogonal 
complement is kerM* = {h G H \ M*h = 0}. Let £ := kerM*, and set 

oo 

(6.3) ft (oo) := p| [M n H] . 

n=l 

Again, if M is an isometry, the classical Wold decomposition of H relative to M states the 
orthogonal decomposition 



(6.4) n = J] e M n c © n 



(oo). 



n=0 



sec 



SzFo70|| for details. The dimension of C is then also a complete invariant for the isometry 
in the pure case, i.e., when 7v-°°^ = {0}. 

In this section, we prove an analogue of this result for general (R, 7r)-isometries. In 
that form, the components corresponding to M n C in ( |6.4|) will instead be [M n £]. We will 
still have orthogonality of the subspaces in the decomposition. The important new element 
for (R, 7r)-isometries is that each of the subspaces in the decomposition is invariant for the 
representation ir, i.e., n (a) maps [M n £] into itself for all a G L°° (T), and n = 0, 1, ... . 
Similarly 7-^°°) is invariant under the operators it (a). The analogy to the classical Wold 
theorem for isometries raises the question of whether some invariant of the representation 
n, when restricted to C = ker (M*), is perhaps a complete invariant in the case when M is 



an (R, 7r)-isometry. This question is answered (at least partially) by Theorem p7^(|q) below, 
while the first two parts of the theorem give a direct analogue of the Wold theorem itself in 
this new representation-theoretic framework. 

THEOREM 6.2. Let m G L°° (T) be a given low-pass filter satisfying the quadratic 

equation 

(6.5) \m (z)\ 2 + \m (—z)\ 2 = 2 a.e. onT, 

and let R be the corresponding Ruelle operator. Let n be a representation of L°° (T) on 
a Hilbert space TC, and let M be an associated (R,Tr)-isometry, i.e., satisfying conditions 
(D~(0) ^ n Definition |6~T] . 

Then TC has an orthogonal decomposition: 

oo 

(6.6) n= ^ [M n c]®n {co \ 

n=0 

where £ = ker (M*), and Ti^ = f]^Li [M n TC]. It has the following three properties: 

(a) the individual closed subspaces in the decomposition are mutually orthogonal, i.e., [M n £] 
is orthogonal to [M fc £] ifn=£k, and they are all orthogonal to HS°°} ; 

(b) each of the spaces [M n C], for n = 0, 1, ... , and Ti.^ is invariant under tt (a) for all 
a G L°° (T); and 

(c) every representation txq of L°° (T) in a Hilbert space £ arises as the n = term of (|6.6|) 
for some (R, ir)-isometry M. 

Proof. In the proof, we shall refer to the two properties (|)-(0) in Definition |6.1| . If S C Ti 
is a linear subspace, the orthogonal complement will be denoted 

S ± =HQS = {heH \ (h,s) = 0, Vs G 5} . 

We clearly have £ = {MH) L . 

Claim 1. £ is invariant under tt (a), a G L°° (T). 

Proof. Let / G £. To show that n (a) / G £, we check that 

M*tt (a) I = tt (a (z 2 )) M*l = 0. 

We used dn|) in the calculation, noting that M*l = by definition. □ 

Our next assertion is this. 

Claim 2. £® [M£] = (A'Puf . 

Proof. We prove the claim by showing that a vector ho which is orthogonal to all three 
subspaces £, M£, and M 2 TC, must be zero. The three subspaces are pairwise mutually 
orthogonal. This is immediate from the definitions except for the last pair. Let / G £ and 



heH. Then 



(Ml\M 2 h) = (M* 2 Ml\h) 
= (M*M*Ml | h) 
= (M*n(\m \ 2 ) l\h) 

= ^tt (Jm (2 2 )| 2 ) M*l 
= 0, 



h 



where we used properties (§) and (|i|), in that order. In the last step, we used M*l = 0. 

The condition that ho is orthogonal to all three subspaces amounts to: ho G [MH], 
M*h G [MH], and M* 2 h = 0. Since H = C® [MH], we have [MTi] C [MC] © [M 2 H], so 
ho = u + v, u e [MC], v e \M 2 H\. Since M*h Q G [M7i], we get ho G (MC) ± . Indeed there 
is a sequence hi Eli. such that M*h = linij Mh^. So if I G C, then 

(MZ 1 h ) = (I | M* h Q ) = lim (/ 1 Mh { ) = lim (M*Z | hi) = 0, 

i i 

since / G ker (M*). Hence it = in the decomposition of /t : h — + v, v E [M 2 H]. But 
/i G ker (M* 2 ) = (M 2 H) L } and we conclude that /t = 0. □ 



Claim 3. [M£] is invariant under 7r (a), a G L°° (T). 

Proo/. We first show that, if l G C and a e L°° (T), then tt (a) Ml e C ® [MC]. Using 
Claim ||, we do this by showing that 7r (a) M7 G ker (M* 2 ). But 

M*tt (a) M/ = vr (iT (a)) Z G C, 

where we used first (^) and then Claim [I]. Then 

M* 2 7r(a) Ml = n (R* (a) (z 2 )) M*l = 0, 

where we could use (0) or, alternatively, Claim |l|. 

This means that n (a) \c®\mc] has an operator block matrix relative to the orthog- 
onal decomposition C © [MC] of the form 

*(a)~(o d)' 

with B: [MC] — > £, and the diagonal operators A and D being endomorphisms of the 
respective spaces C and [MX]. Since % (a)* = 7r(a), the adjoint (^» ) must be of the 
same form, and that forces B = 0, i.e., n (a) = (^), and each of the spaces C and [MC] 
is then invariant under ir(a). In particular, [MX] is invariant, which is the claim. □ 

Our next assertion merges Claims |2| and [3] into the following induction: 

Claim 4. For each n = 1, 2, . . . , we have the decomposition 

C © [MC] © ■ • ■ © [M n - x C] = (M n H) ± , 



where the terms in the decomposition are mutually pairwise orthogonal, and further each of 
the spaces [M fc £] is invariant under n (a), a G L°° (T), k = 0, 1, . . . , n — 1, where we set 
[M°C] = C. 

Proof. This is a simple induction which is based on Claims [l|-|3], and it is left to the reader. 
Alternatively, we can prove it by using the earlier claims on the operators M 2 , M 3 , . . . . □ 

Claim 5. We have the decomposition ( |6.6|) of the theorem with the two properties 

(!)-(§■ 

Proof. For each n, let Q n denote the projection onto [M n TL\. Since [M n+1 H] C [M n H], this 
is a decreasing sequence of projections in TC, By Hilbert space theory (see, e.g., |[SzFo70|| ), 
it has a limit Qoo, i.e., lim^oo \\Q n h — Qooh\\ = for all h G Ti., and Qoo is the orthogonal 
projection onto TiS 00 ' = H^Li QnH- Recall that Q n li. = [M n Ti], by definition! In fact, 

1^00^11= inf \\Q n h\\, heH. 

n 

But Claim g states that / - Q n is the projection onto C © [ML] © • • • © [M n-1 £], where we 
write / for the identity operator in Ti. Now / — Q n is an increasing family of projections, 

oo 

and its limit / — Qoo is the projection onto [M n C]. Let B denote this space. Vectors 

n=0 

b in B are characterized by (/ — Qoo) b = b, or equivalently Qoob = 0, and each b has the 

unique representation b = b n , \\b\\ 2 = J^'o' IIMI 2 ' ^« ^ [M n £], n = 0, 1, 

Since each of the spaces [M n £] is invariant under n (a), a G L°° (T), by Claims [|-[|, 
it follows that B is also n (a) -invariant. Since 7r (a)* = n (a), it follows that 

oo 

(6.7) B L = H {oo) = p| [M n U] 

n=l 

is also 7r (a) -invariant. □ 

The proof of Theorem |6.2| (|c|) will be given after the next three corollaries. 

COROLLARY 6.3. For each n = 1,2,..., the space [M n H] is invariant under 
ti (a), ae L°° (T). 

Proof. We showed in Claim that 



n-l 



(6.8) (M n H) ± = [ MkjC ] ' 

k=0 

and that the right-hand side has the tt (a)-invariance. Since % (a)* = n(a), we conclude 
that (M n H) ±± = \M n U\ is also vr (L°° (T)) -invariant. □ 



Let (i?, 7r,M, 7i) be as in the statement of Theorem |6.2| , i.e., M: H — > 7i is an 
(.R, 7r)-isometry relative to some Ruelle operator R and representation it. We say that a 
closed subspace 5 C H is double invariant if 5 is invariant under both M and M*. It is 
then immediate from the definition of M* that S is double invariant if and only if both S 
and S 1 - (—HQ S) are invariant under M, i.e., M (S) C S, and M C S ± . 



COROLLARY 6.4. Let {R,ir, M, H) be as described, and let B (= ^® [M n C]) and 
n (oo) ( = [ M n n ^ be as m Theorem g]| o 

Then both B and are double invariant under M . 

Proof. From the comment before the statement of the Corollary, it is enough to show that 
both B and H {oo) are invariant under M. Recall from Theorem |6.2| that B = (Ti.^ 00 ^ , 
and H (oo) = B L . But it is clear from the definition of H {oo) that M (H {oo) ) C H {co) . Since 
arbitrary vectors b in B may be represented as b = M n l n , l n G C, \\b\\ 2 = ||M n / n || 2 , 
it follows that Mb = M n+1 l n . If b is encoded with the sequence (Z , h,h,---), l n £ C, 
then M6~ (0, l , h, . . . ), i.e., 



(6.9) 



M((l J 1 ,l 2 ,...)) = (0J ,l 1 ,...) 



and the M-invariance for B follows from this. By the initial argument we conclude that both 
B and W(°°> are double invariant. □ 



The advantage of the (l , Z l5 Z 2 , 
cially simple form: 



representation of B is that M*\g takes an espe- 



COROLLARY 6.5. If vectors in B are represented in the form (lo,h,h, ■ ■ ■), 
]T2" ||/ n || 2 < oo, l n E C = ker (M*), then the action of M* on B is 

(6.10) (l ,k,k, • • •) 1 — ► (jr (\m \ 2 ) h,ir(\mo {z 2 )\ 2 ^j l 2 ,n (\m (z 4 )\ 2 ^ Z 3 , • • •) • 

Proof. The proof follows from the following calculation: M*Iq = 0, and M*M n l n = 
n (|m | 2 ) M n - X l n = M n - X n 



m z 



I,, . 



□ 



Proof of Theorem 6.2| (|c|) . Let the representation tvq of L°° (T) in £ be given as in Theorem 
3(H). Let m , h be as stated at the outset, i.e., R mo (h) = h. On the vectors described in 
j|), define the Hilbert-space norm, and corresponding completion, by 



oo 



n=0 



and define M as in ( |6.9|) . A simple computation, using the corresponding inner product 



oo 

<(oko> ^=E(^o(4 n) 



n=0 



, (n) 
^0 m 



and Table || then yields an adjoint operator M* which turns out to be ( |6.10|) . It is now a 
simple matter to verify that M is the desired sub-isometry. □ 



REMARK 6.6. It follows from Corollary |6.3| that each projection Q n (onto the space 
[M n Ti]) commutes with tt (a), a G L°° (T), i.e., Q n ii (a) = % (a) Q n , but it is generally not 
the case that M commutes with n(a). However, M may possibly commute with a special 
7r (aio) for some ao G L°° (T). We have the following simple result on that. 



PROPOSITION 6.7. Let uiq e L°° (T) be as specified in Theorem |6.2| , and ?ef n 
be a faithful representation of L°° (T) on a Hilbert space 7i. Let R be the Ruelle operator 
constructed from m , and let M be a given sub-isometry. Let a L°° (T). If Mix (a ) = 
it (ato) M, then it follows that R(ao) = olq, i.e., «o is an eigenvector for R with eigenvalue 
1. 

Proof. By (|) of Definition 

7T (|m | 2 a ) = M*Mtt (a ) = M*ir (a ) M = n (R* (a„)) • 

In the first step, (|) is used on 1, then commutativity is used, and in the last step, (0) is used 
on a . Since it is assumed faithful, 

(6.11) \m \ 2 a = R* (a ) = \m {z)\ 2 a (z 2 ) , 

and 

R (a ) (*) = \ ^2 l m ° ( w ) I 2 a ° ( w ) 



„2- 



- ^ |m H| 2 a (^) 



2 

= «o (z) , 

where the quadratic property of m was used in the last step. Hence R (a ) = a o as 
claimed. □ 



REMARK 6.8. The converse implication to the one given in Proposition |6.71 is not 

true. 

Let m (z) = A= (1 + z 3 ) (see also Section [|), and let R and M be the corresponding 
Ruelle operator and cascade operator. The scaling function tp realized in L 2 (M.) is (p — |xr 3 ,, 
and a little calculation (see Section £|) shows that 

p 2 (<p) =p(<p,<p) = l [z~ 2 + 2Z" 1 + 3 + 2z + z 2 ), 
y 

where z = e~ tuJ , to G R. But R(p 2 (<p)) = p 2 (<p), so this is an eigenfunction for i£. Let 
<2q := £>2 {(f)- We claim that 7r (ao) does not commute with M. In fact, commutativity with 
M is equivalent to identity ( |6.11| ) from the proof of Proposition p.7\ An inspection shows 
that flS.lip is not satisfied in this example. In other words, 

(6.12) \m (z)\ 2 a (z) ^ \m (z)\ 2 a (z 2 ) . 



Both sides in ( 6.12Q are polynomials, i.e., in C [z, z 1 ]. The right-hand side contains a term 



z 7 whereas the left-hand side does not. 



REMARK 6.9. The eigenvalue problem for R plays a crucial role for approximation 
of wavelets; see, e.g., gtr9| , ||CoDa96|| , |Vflg3[ . 



In the study of refinement operators, the sub-isometries usually have a slightly 
different formulation, and for the particular cascade operator M in ( ft.8| ), the alternative 
formulation is stated in Lemma fO] ; see (|4.12|) . 

We now specialize the general formulation (tt, M,H) of the present section. Recall 
that 7r was a representation of L°° (T) on a Hilbert space H, and M : H — > H was an operator 
satisfying (0)-(0) of Definition |D] relative to some given Ruelle operator R = R mo - We will 
specialize as follows: H = Hz (the Hilbert space of Section |), and 

(6.13) (ir z (a)H(z,x)) = a(z)H(z,x), a G L°° (T) , H G Hz- 

In this specialized setup, we then have the following. 

PROPOSITION 6.10. Let the 'pair (ttz,Hz) be as described, and let M be an oper- 
ator in Hz- Then the following two conditions are equivalent. 

(i) For a.e. z in T, we have the identity 

(MHi (z) | MH 2 (z))„ m = R (<#i ( • ) | H 2 ( • )) L2(0jl) ) (z) 

for allE x ,E 2 G H z . 

(ii) M*%z (a) M = tt z (R*a) for all a G L°° (T). 

Here the Ruelle operator is defined from an arbitrary m as usual. 
Proof. © ©. Let Hi,H 2 G Hz- Then 

fa I (M*n z (a) M) H 2 ) Hz = [ {MH X (z) \ MH 2 (z)) a (z) dfi (z) 

= [ R((H 1 (-)\H 2 (-)))a(z) dfx(z) 
Jt 

= [ (H 1 (z)\H 2 (z))\m (z)\ 2 a(z 2 ) dfi(z) 
Jt 

= (H x \ir z (R* (a))H 2 ) Hz , 

and this proves (0). 

© =^ (HD- W (0) holds, the calculation shows that the second and the third terms 
must agree for all a G L 00 (T), and, by duality, this means that @ must hold a.e. on T. □ 

7. SINGULAR CASCADE APPROXIMATIONS 

We return in this section to the cascade operator M from Section [1], but it will 
be convenient to state the results for the Hilbert space Hz of Section [| We will also need 
the representation t\z of ( |3.12|) . From the dictionary in Lemma |5.1| (Table we note that 
Z-translations in L? (R), h(-) t— > h( - +n), n G Z, correspond to vr^ (e n ) in 7^^ where 
e n (z) = z n . Also we need the fact that the operation h i— > a * /i on L 2 (R), h £ L 2 (R), 
a G L°° (T), corresponds to multiplication a (e~ ia; ) /i (cj) where /i (a;) = J K e~ luJx h (x) dx. 
Hence the result when stated in Hz can easily be translated to either L 2 (R) or L 2 (R). 



Let Hi G TCz- We shall need the sesquilinear form p from Section || 

p(H 1 ,H 2 )(z) = / H 1 (z,x)H 2 (z,x) dx = (H 1 (z)\H 2 (z)) L2 , Q1) . 
Jo 

If H = H\ = H 2 , we introduce the abbreviation p 2 (H) = p (H, H). An important property 
of a starting vector h G I/ 2 (R) for the cascade algorithm is orthogonality of the translates 
{h ( • — n) | n G Z}, i.e., 



(7.1) / h(x — n)h(x) dx = 5 n fi 



2 

2 ' 

The following is immediate from Proposition (Section^): 

LEMMA 7.1. Let h G L 2 (R), and set = Z/i. TTte following are equivalent: 

(i) /i satisfies the orthogonality condition (|7TT|); and 

(ii) p 2 (#) (2) = \\H\\h z a - e - on T - 

We shall refer to this as the orthogonality condition, meaning orthogonal 'L-translates in 
L 2 



REMARK 7.2. We note similarly that when the Strang-Fix condition 

(7.2) J2h(x + n) = 1 

makes sense for h G L 2 (R), e.g., if h is of compact support, then this corresponds to the 
following condition on H = Zh: 

(7.3) H(z = l,x) = l a.a. x e [0, 1] . 

It is well known that the cascade approximation must start with the orthogonality 
condition of Lemma |T7T| . Suppose H G Hz satisfies ( \l.3\j , and 

(7.4) H = H B + H OQ 

is the Wold decomposition from ( |6.6|) in Theorem |6.2] , i.e., 

oo 
n=0 

and 

oo 

Hoc G ft (oo) = pi [M n 7Y] 

n=l 

relative to a fixed (R, 7r)-isometry M, where R = R mo i and tuq is given. Then, if H satisfies 
the orthogonality ( [7.1|) , or equivalently ([n]) in Lemma then one of the two, H& or H^, 
can satisfy the same only if the other is zero. This follows from the following formula for the 
norm in H = Hz- 

(7.5) \\H\\h z = f P 2(H)(z)dfx(z). 



Applied to the decomposition H = Hg + Hoo in (|7.4j) , we get 

(7.6) / p 2 (H) (z) dfi (z) = fp2 (H B ) (z) dfi (z) + f p 2 (z) dfi (z) 

Jt Jt Jt 

from which the claim is clear. If H is orthogonal, then 

1 = ||#|| 2 = fp2{H)(z)d^{z), 
Jt 

so if for example p 2 (Hb) = 1; then 

1 = 1+ p 2 (Hoo) (z) dn (z) , 
Jt 

so = 0. 



While Remark |7.2| above shows that the ^-decomposition is an obstruction to cas- 
cade approximation, it is still the case that solutions H to M (H) = H (i.e., H = Zip for 
some scaling function ip 6 L 2 (R)) yield 

PROPOSITION 7.3. Let tuq be a low-pass filter, and let R be the corresponding 
Ruelle operator. Let nz be the representation (|3.12| ) or ( |122| ) of L°° (T) in Hz- 

(a) Let M be a sub-isometry. Then 

(7.7) R(p 2 (H))=p 2 (MH), 
so if M (H) = H , then p = p 2 (H) solves 

(7.8) R(p)=P- 

(b) In general, the Fourier series for p 2 (H) is 

(7.9) p 2 (H) (z) = ^z n (Ti (e n ) H \ H) Hz , 

where e n (z) = z n , and 

\\H\\ 2 Hz = J2\^( e n)H\H}\ 2 . 



Proof. Immediate from Lemma and Proposition |6.10| in the previous section. The calcu- 
lation of the expansion ( |7.9|) follows from checking that the Fourier coefficients of p 2 (H) (z) 
are as stated, i.e., that 

z~ n p 2 (H) (z) d/j, (z) = [p (n z (en) H, H) (z) d/i (z) = (n z (e n ) H \ H) Hz . □ 

Jt 

We show next that A = 1 is the only point on the unit circle which is an eigenvalue 
of R, at least when we restrict to continuous eigenf unctions. 

THEOREM 7.4. Let m be a continuous low-pass filter, and let R be the corre- 
sponding Ruelle operator in L°° (T). Suppose that the eigenspace {(eC (T) | R(£) — £} is 
one- dimensional, i.e., ClL. Then if |A| > 1, and A ^ 1, the eigenvalue problem 

(7.10) R(a) = \a 

has no nonzero solution in C (T) . 



Proof. First note that if mo is assumed continuous, then R maps C (T) into itself. We see 
this by first approximating m with finite sums ^2 k cik zk - For each such finite sum, the 
corresponding R maps C [z, z~ l ] (= the ring of finite Fourier series) into itself. Since the 
norm of R, as a operator in L°° (T), is one, i.e., H-Rjl^ ^ = 1, we conclude that R maps the 
norm closure of C [z, z -1 ] into itself. This norm-closure is C (T) by the Stone-Weierstrass 
theorem. 

Since R leaves C (T) invariant, it dualizes to a map on the measures M (T) = C (T)*, 
and we claim that R* (8\) = 5± where 5i denotes the Dirac point measure at 1. Indeed, let 
(6C (T); then 

m a) = \ E \™ nnn = \ (K(i)i 2 e(i) + im (-i)i 2 e(-i)) = 

w 2 =l 

since m (1) = v2 and m (—1) = 0. (This is the low-pass property.) It follows that a (1) = 
when a is the eigenfunction in (|7.10| ). 

Suppose we did have a solution a to (|7.10|) as stated. Let c G C, and set (3 C = 1 + ca. 



Then f3 c (l) = 1. Pick a scaling function Hq for the cascade operator M corresponding to 
mo, i.e., M = M mo , and M (Ho) = H . Since £i = {£ G C (T) | R^ = £} is one-dimensional, 
and p 2 (H ) G £\ by Lemma fT7[|(|iD, we conclude that p 2 (H ) = CI. Lemma |77T1(pH) then 



implies that C = 1, so p 2 (H ) = 1. We then have 

(7.11) iF (&) (*) = iT (to (#„)) (z) = R n ( P (H , tt (f3 c ) H )) (z) 

= p(H ,M n n z (P c )H )(z), 



using the argument from the proof of Proposition |7.3| . The left-hand side of ( [7.11 ) is U+cA n a, 
which is not convergent when c 7^ 0, A 7^ 1, |A| > 1, and a 7^ in L°° (T). 

On the right-hand side in (|7.11|) , some more analysis is needed. Our next claim is 

that 

lim (M n 7T Z ((3) Ho) (z, x) = /3 (1) H (z, x) 

whenever (3 G C (T), and the limit is in Hz, or in L 2 (R) after a translation of the result via 
the Zak transform. Using Lemma |5.1|, we get 



M n n z ((3) Ho (z,-) = 2-S £ m (w) m (w 2 ) • • ■ m (w 2 "" 1 ) /3 (w) # K 2»s) . 

UI^ = 2 

Let mj,"' (iw) := m m (u> 2 ) • • • m ^u> 2 ™ 1 j . Since M (H ) = H , we get for the difference 
(picking /3 such that (3 (1) = 1) 

Ho - M n 7r z (/?) Hq = 2~t ^ m n) (w) • (1 - /3 (w)) if («;, 2 n x) . 

on 
«A> Z =2 

The Hz- norm is that of L 2 (T x [0, 1]); we split up the integral ^- ■ ■ ■ du> over T into 
two regions, one —5 < uj < 5, and the other the union of the intervals — tx < uj < —5 and 
5 < uj < n. We pick 5 such that |1 — (3 (e~ lul )\ < e when \u\ < S. After estimating the two 



separate contributions, and using Lemma 7.1 and Corollary |4.5|, the desired result follows 



When it is applied to the right-hand side in ( 7.11|) , we get 



p(H ,M n 7r z {p c )H )(z) — > c (l)p 2 {H o )(z) =p 2 (H )(z) 

n— >oo 

in the L 1 (T)-norm. But /3 c (l) = 1. Comparing the two results for the limits of the left- 
and right-hand sides of ( [7.111 ), we arrive at the desired contradiction, and conclude that the 
eigenvalue problem ( [7.1 0|) does not have eigenvectors as stated. □ 

8. SINGULAR VECTORS 

We now return to the subspace B of the Wold decomposition in Theorem |6.2| . But 
we will specialize to the Hilbert space 7i = Hz, although the operators on H z correspond 
to (unitarily equivalent) versions, on L 2 (R) and L 2 (R), via the inverse Zak transform Z* 
and the Fourier transform, respectively. 

The object is to understand when the singular space B is present in the decompo- 
sition Tiz = B © T-C^°°\ This is important, as we showed in the previous section that the 
cascade approximation picks up divergences when B ^ 0. Since 

oo 

(8.1) B = ^®[M n £] , 

n=0 

where £ = ker(M*), we see that the question is decided by the question of when £ ^ 0. 
Hence we need to study M* more closely, and the second relation (|]) from Theorem |6.2| is 
crucial for that. While there is some connection between the results in this section and those 
of |PaLa|| , there are several differences as well: the present approach is general and applies 
equally well to higher dimensions, matrix dilations, general iV-to-1 maps (onto) in metric 
spaces, and, for the wavelets, even to the case when the Hilbert space is different from the 
standard one, i.e., L 2 (R). 

Let h G L 2 (R), and set h(to) = L e~ tuJx h (x) dx. The inverse Fourier transform 
will be denoted 

f V (*) = 7^ f j" X f M diV. 



2tt _ 

If S C L 2 (R) is a closed subspace, we set 

(8.2) 5 V = {/ V |/G5}. 

If E C R is a measurable subset, consider the projection P E given by 

(p s /)H = x B H/H, to ei, 

and define 

(8.3) L 2 (E) := P E (L 2 (R)) . 

We now turn to the closed subspaces in the decomposition (|6.6| ). Considering the 
L 2 (R) model, we show that there are measurable subsets Ek (mo) C R such that 

(8.4) [M k C] = L 2 (E k (m )) v , the L 2 (R) picture, k = 0, 1, ... , 



where ( • ) v denotes the inverse Fourier transform in L 2 (R). In fact it follows from conclusion 
(|b|) of Theorem |672|, i.e., our Wold-type decomposition theorem (Section ||) that the spaces 
[M fc £] C L 2 (R) must have the stated form (|8.4| ), but the object in the present section is to 
find the sets. To see this, recall the formula 

(8.5) (tt (0 hf(io) = £ (e-*") h (u) , £ G L°° (T) , h G L 2 (R) , G R, 

for the L°° (T)-representation 7r. Using Theorem |6.2| a second time, we conclude that there 
is also a measurable E^ (m ) C R such that 7i(°°) = (L 2 (i?^ (m ))) v . Using finally the 
orthogonality part of the conclusion in Theorem |6.2|, we note that 



(8.6) |J E k (mo) U (m ) = R, 

which is to say that they form a tiling of R. Since the spaces in the decomposition are mu- 
tually orthogonal, the sets E k (mo), k — 0, 1, . . . , k — oo, must be pairwise non-overlapping 
up to measure zero in R, i.e., 

E k (m ) n Ei (m Q ) 

has Lebesgue measure zero when k ^ I. 

When the measurable subset E C R is given, we use the notation 2E for 

(8.7) 2E = {2u | u G E} , 
and similarly, for a G R, set 

(8.8) E + a = {uJ + a\ueE}. 

The mapping u; i— > e~ la; is a measurable bijection of [— 7r, tt) onto T. Let Co be its 
inverse (also measurable). If log denotes the corresponding principal branch of the complex 
logarithm, then 

(8.9) co(z) =«log(*), zGl 

Let mo be a subband filter satisfying conditions (0)-(|i|) in the Introduction, and 

let 

(8.10) N(mo) := {z G T | m {z) = 0} . 
Let M denote the corresponding cascade operator, i.e., 

(8.11) (Mh) (x) = V2^a n h{2x - n) , h G L 2 (R) , 

where m Q (z) = Xlnez a nZ n - Finally, let E (m ) be given by 

(8.12) £(m ) = |J 2c (^(m )) + 47rn. 

LEMMA 8.1. 

ker (M*) = L 2 (m )) v = {/ v |/Gi 2 (m ))} . 



Proof. We have the formula 



(8.13) (M*h) (u) = m (e"*") ■ h (2u) , weM, 

directly from a Fourier transform of Af*. It follows that h {2u) = if z = e~ tuJ GT\JV (mo). 
The lemma now follows from (|8.12 ) and ( 8.13|) . □ 

An immediate consequence of this lemma and ( 8.13Q is the following result: 

PROPOSITION 8.2. We have the equivalence 

ker (AT) = {0} <=> n (N (mo)) = 0, 

where fx denotes the Haar measure on T. 

We shall need the following second consequence of the lemma: 
PROPOSITION 8.3. If he ker (AT), tfien h = on E (m ) + 2tt. 
Proof. We have 

£ (mo) + 2tt = |J 2 • (c (TV (m )) + tt) + 4vm = (J 2 • c (— iV (m )) + 47m. 

But if z G iV (mo), then |mo (— z)| = v2 (from property ([ui|) of mo in the Introduction), so 
—2 e T \ N (mo). Formula ( J8.12 ) then implies that /i (u; + 2-7r) = if u e E (m ), which is 
the desired conclusion. □ 

We now turn to ker (M* fc ), k > 1. We find measurable sets F fc (m ) C R such that 

(8.14) ker(AP fe ) = (L 2 (F k (m ))) V . 

Let m ^ (z) := m (2) m (z 2 ) ■ ■ ■ m [z 2k j. Then 

(8.15) JV (rr$A = 1 2 G T | m [k) (2) = o} 

= N(m ) U {z I z 2 G iV(m )} U • • • U | z 2 ^ G Ar(m )j . 
By Claim [| in the proof of Theorem |6.2| , we have 

(8.16) ker (M* k+1 ) Q ker (M* k ) = [M k C] , 
so we will get the components [Af fc £] from the following lemma. 

LEMMA 8.4. The sets F k (m ) from formula (pl4|) /or ker (M* k ) are 

(8.17) F fc (m ) = |J 2 k ■ c (n (m fc) )) + 2 A 



7m. 



Proof. This is the same argument as the one used in Lemma |8.1| above, and it is based on 



(AT k h) (<jo) = m (k) (e-^) • h (2 k ■ u) , h e L 2 (R) , wel, □ 




Figure 1. <x~ 2 (N (mo)) 



In using formula QS.14 ), the following observation on A" 



z 2 be the square map of T, and let A" C T be a subset. Let 



(*) 



is useful. Let a (z) 



a' 1 (N) = {zeT\ a(z) G N} . 

To understand the dynamical picture of a and its multivalued inverse cr _1 , i.e., their itera- 
tions, we have included a graphical illustration in Figures [TJ and |2|. Then 

N (m n A = N (mo) U a' 1 (N (m )) U ■ • • U a'^'^ (N (m )) 



and 



iV (m n+1) ) \ AT (m n) ) = a"" (N (m )) = {2 G T | / 6 iV (m )} . 
For the example which follows (Example below), we will have 
AT (m ) 

a" 1 (TV (m )) 
<7- 2 (AT(m )) 
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7tt 



the last made up of four arc segments on the unit circle (see Figure |]); and the induction is 
clear for the general case a~ n (N (m )). 
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FIGURE 2. An ergodic map on T 
When Lemma |S.4| is combined with formula ( J8.16 ) , we get 



(8.18) [M k C] = (L 2 (F k+1 (m ) \ F k (m ))) V , 

again with ( • ) v denoting inverse Fourier transform. We now combine the results above into 
a proposition. 

PROPOSITION 8.5. We have 

(8.19) [M k C] = L 2 (E k (m )) v , 



where 



starting with 



E k (m ) = F k+1 (m ) \ F k (m ) 



E (m ) = F a (m Q ) = E (m ) = \J 2c (N (m )) + 4™. 



Proof. Contained in the previous argument. 



□ 



(8.20) 



EXAMPLE 8.6. It is known that the function ip e L? 

sin ttx 



given by 



rex 



is a scaling function. The subband filter mo is 

m (e~ luJ ) = >/2X[_ 5 , f i 0*0 • 



Hence 



and 



c (^(m )) 



7T 

-7T,-- ) U 



7T 



2/ L2' 
£ = ker(ilT) = L 2 (£(m )) v 



7T 



where 
(8.21) 



E (m ) = [J [vr, 3vr) + Ann. 



neZ 



Writing m f~) = m (e l 2), we see that 



'UJ 

m [- + ir 



V2 X 



E(m ) 



and 



m 



(I) - ^ 



G(m ) l^J J 



where G (m ) = LLez [ — 71 ^ 7r ) + ^ 7rri - We have E (mo) U G (m ) = K, and 

G (mo) = E (mo) + 2tt, 
which yields the following formula for the Ruelle operator: 

(8.22) (RO (w) = XG(mo) (w) £ (|) + X S(mo ) (w) £ (| + tt) , £ € (T) , u G 
where we use T ~ IR/27rZ. It is easy to check directly from ( |3.22| ) that 

(R£) {u + 2tt) = (i?£) (w) , for all wet. 
Similarly the scaling identity for if, i.e., 

\f2(p (u) = m (e -< *) <p (^J , 

takes the form 



w g 



which in addition to <£i = Xutttt) has the solution (p 2 = XG(m )> ^ u ^' °^ course 5 XG( mo ) * s no ^ 
in L 2 (R). Nonetheless, the inverse transform, ip 2 = XG(m ) V ' ma kes sense as a distribution. 
The corresponding quadratic expression, p 2 (<p) = p(<p,<p), satisfies 

PzMie-™) =1, 



while 



Since 



P2 



faa) (e"^) = XG(m ) (<" + 27m) = oo. 



m 



^ (w) = 2'X[_ 5if ) (M) X[_ 5>5 ) (M ■ • ■ X[_ 5)f ) ([2 fc -^]) , weK, 

where := c (e _JW ), we see that the higher cases result from an iteration of an ergodic 
map on T as follows: Cq (z) i— > Cq {z 2 ), or w h> 2o;mod2"/T. Starting with 



we get 



and by induction 



c (AT (m )) 



,(2) 



c AM m ' 



7T 



7T 

-7T,- I )U 



7T 



7T 

L4 : 



Co UV < 



vr \ 

-7T, r ) U 

' 2 fc / 



7T 

2fc' 



7T 



(see (|8.9|) and Figure || for the map cq) , and 



where 



starting with 



and 



Fk (m< fc) ) = |J [-2 k TT, -tt) U [tt, 2*tt) + 2 
[M fc £] =L 2 (E fc (m )) v , 



-Efc (%) = F k+ i (mo) \ F k (m Q ) , 
E (mo) = E (mo) = F, (m ) = (J [tt, 3tt) + Attu 



£1 (^o) = F 2 (mo) \ F 1 (m ) = (J [2tt, 6tt) + 8tt?/. 

riGZ 



REMARK 8.7. Example [B.6| is a special case of a classification from ||BrJo97|| of 
filters mo for which |m | takes on only the two values \/2 and 0. However, the present 
analysis is focused on convergence questions which are not addressed there. 

There is one more feature which sets this class of examples apart from those where 
£ = ker(M*) = 0, i.e., those for which the complement of the support of mo in T has 
measure zero (see Proposition |3.2|) . When ker (M*) ^ 0, then 

(8.23) £\ = £ L°° (T) \ R(£) = £} 

may be infinite-dimensional. 

In Example gl| recall H ioo) = L 2 (-tt, tt) v , and 

(8.24) ft (/He-*") =X [ -„, w >M|/M| 2 

= iW> (l/l 2 ) (") > fen^\ ueR. 

Since Mtp = (p, it is clear that (p G H.^, where, in this case, 

oo 

= p) M n (L 2 (R)) = L 2 (R) e {M fe £ | fc = 0, 1, . . . } , 

n=l 

where C := kerM*. Since TiS°°) = L 2 (i? 00 ) v , it follows that [— tt, tt) C -E^, or equivalently 
L 2 (-7T,7r) v C L 2 (^oo) v . But if iteration of ([813D for (M*"fr)~ is combined with the above 
analysis of the example, we conclude that, in fact, the inclusion is equality, i.e., [—tt, tt) = E^, 
up to Lebesgue measure zero in R, and, therefore, TiS°°) = L 2 (—tt, tt) v . The main ingredient 
in this argument is the known ergodicity of the map z i — > z 2 of T; see Figure ||, and |[Kea72 
for details on iV-to-1 endomorphisms of measure space. 

We now consider the cascade approximation lim^oo M"/i in L 2 (R). The object is 
to pick h such that the limit is the scaling function <p. By Theorem |6.2| , the optimal choice 
dictates h E so that excludes (for the example) the standard choice for starting point 



of cascades h — > Mh — > M 2 /i which is /i = xp ^. Clearly A (a;) = Xro 1) 

is not in L 2 (— ir, 7r), so M n X[ ^ — </? 



— > as n — > oo is at best a slow approximation. 

Let h = his + /ioo be the B © 7-^°°) decomposition of Theorem |67|. From Corollary 
33 we have M n h^ G £ and M n h OQ G ft (oo) , so 

M n /i - y? = M n /ioo - <p + MVi^ 
7-^(00) b 



and 
(8.25) 



\M n h 



\M n K 



PROPOSITION 8.8. Let m (e- iuJ ) = V^Xr* jr\(M) as a function onT = R/2%Z, 

[ 2 '2/ 

ane? /e£ i? ; M 6e i/ie respective Ruelle and cascade operators. Let ip = X[- n V anc ^ ^ = Xro 1} ■ 
JTien M(p = <£> and h — hs + h^, where 



and 



hoc H = X[-,- )W > H 



e 2 



sin ( -) 

/is (w) = Xru-*,*) (w) e- l 2 — AM 

2 

w£/i t/ie approximations: 

(i) ||M n /i 00 — y?L — ► 0, and 

(ii) ||M n ft B |L — * °- 

n— >oo 

Proof. Note that, when are combined with (J8.25 ) , we get \\M n h — ip\ 

is a better approximation. 

We first prove (§) by checking that 

(a) ||AP7ioo||2 -* !> and 

(b) (M n h OQ I — > 1, as n — > 00. 

For the first term flal) we have 



.sin(f) 



0, but ffl 



(8.26) 
where 



00 112 



R n { P2 {h 00 )){z) dfi(z) 



P2 {hoo) (e~ iu ) = \ h oo + 27m) 



We take — 7r < < 7r, and recall the formula [uS] 

, s 2 



Hence 



p 2 (ft*) (e~ iW ) = 
Meyer and Paiva 



. Since this function is continuous, it follows from a theorem of 



MePa93|| that 



r R n (p 2 (h^)) (e^) du — p 2 (/i c 

27r y_„ n-00 



,) (w = 0) = 1. 



In view of (|8.26|) , this proves The application of the Meyer-Paiva theorem (see ||MePa93| 
and line 3 in Table ^| above) requires that p 2 (<p) = 1 on T, which clearly holds for the present 



The argument for (|b|) is similar: 



(M n h oc | ip) 



R U (P (/ioo, V 2 )) («) d/J, (z) > p (/loo, V 5 ) = 1) = 1. 



The last step is based on the formula (\cu\ < ir) 



p (/loo, V 9 ) ( e IW ) — ^^oo + 27m)(f (u + 2nn) 



(n = term) = (a;) = e 



sin 



(!) 



It remains to prove (|n|). We have 



\M n h 



B\\ 2 



R n (p 2 {h B )){z) d(i(z), 



and we check that the theorem from [ |MePa93| applies to that term as well: by the above 
argument, we have (for \u\ < it) 



p 2 (h B )(e-^)= 



sin 2 (|) 



nez\{o} (f + nir ) 

By a standard summation formula (see, e.g., ||Car95| , p. 152]), we have 

1 1 1 



£o } (|+H 2 sin 2 (|) 



Hence 



P2 (h B ) (e"*") = 1 



sin 



2 (f) 



(!)' 



which is continuous and equals at u = 0, so (y) also follows from an application of 



MePa93 . 



□ 



In the course of the proof, we established the following result. 



PROPOSITION 8.9. Let <p := Xu-k-k)^ {inverse Fourier transform), and let f G 
L 2 (R) be given with L 2 -Fourier transform f . Then we have the following a.e. identity on T: 

pfaf) (e-^)=X[-^)(M)/H, 
where p is defined on L 2 (R) x L (R) via the Zak transform, 



P (V, /) ( z ) = / (^V 3 ) 2r) (^/) («, a;) rf x 
./o 



[again for a. a. z G T). 



9. APPROXIMATION RESULTS 



In this final section we prove a general approximation theorem which is based on 
some of the same ideas which went into Example |8]6| in the previous section. It uses the 
approximation kernel 

2 



(9.1) 



D n (z) := m (z) m (z 2 ) ■■■m (z 



R* n (H)(z). 



This kernel may be constructed from any filter mo subject to conditions (@)-(fif) in the 
Introduction. If the corresponding scaling function ip with tp (0) = 1 satisfies p 2 (<p) = 1, 
then by a theorem of Meyer and Paiva [|MePa93|| , D n (-) is an approximate Dirac delta 
function 5\ on C (T). In Example |8.6| above, the kernel D n computes out as: 



D n (e-™ ) = 2t X[ _ fj5) (u) X[ _ fjf) ([2a;]) ■ • ([2 n "M) > 

where, for s G R, we set [s] = s — A;27r if A; G Z is such that (2A; — 1)tt<s< (2k + 1) tt, or 



equivalently [s] = Co (e 



s G 



DEFINITION 9.1. Let f G L 2 (T), and c G C. While it is not then possible to assign 
a value to £ (1), i.e., £ evaluated at z = 1, we will say that E£ (1) = c if £ is continuous at 
z — 1 and assumes the value c there. 



THEOREM 9.2. Let mo satisfy conditions (fl)- (|hi|) m £/ie Introduction, and let R, 
M be the corresponding Ruelle and cascade operators, where we view M as a operator in 
Hz — L 2 (R). Let H G Hz be a regular scaling function, i.e., \\H \\ = 1, and M (H ) = H . 
Let F G Hz satisfy p 2 (F) = 1. Then the following two conditions are equivalent: 



(a) \\H -M n F\\ Hz — 0, 



(b) Ep(H ,F)(l) = l, 
where the evaluation at z = 1 in (O) is specified in Definition |9~T 



Proof. Since ||//o|| = lj it follows from Lemma [7.1| that P2 (Ho) — 1, i>e., we have both of 
the families {n (e n ) H \ n G Z} and {-7T (e n ) F | n G Z} orthogonal in 7i^- For L\ we also 
have 

11^= / P 2 (F)(z)^(z) = l. 

Similarly, 



|M"F||^ = / p 2 (M n F)(z) dfi(z) 
R n (p 2 (F))(z) d/i{z) 
R n (1) dn 
1dn = l, 



so, for the norm-difference in @, we have 

(9.2) \\H - M n (F) f Uz = 2 - 2 Re (H \ M n (F)) Hz , 
which means that condition (||) is equivalent to 

(9.3) UmRe(H \M n (F)) Hz = l. 

This last term computes out as follows: first introduce the sequence 

D n (z) := m (z) m (z 2 ) ■ ■ ■ m (^z 2 " 1 j 
(see (p.l|) above), and note that 

R* n {t) = D n (z). 

Since P2 (Ho) = If , we know from a theorem of Meyer and Paiva [|MePa93|| that D n (z) — > 
for z in the complement of any neighborhood in T of z — 1. But the ( ■ | ■ )- H term in 
is 



H | M n F) H = (M n H | M n F) 



Hi 



R n (p(H ,F))(z) dfi(z) 
= [ R* n (-SL)(z)-p(Ho,F)(z) dfi(z) 
= [ D n {z)p{H ,F){z) dfi(z). 

JT 

If p (Hq, F) were continuous, the theorem from |[MePa93[] would simply give 



(9.4) 



D n p(H ,F) dfi F)(l), 



and the equivalence of @ and (^) would be clear from this and (|9.3| )- If (9.4) does not hold, 
we still have p (H , F) e L 2 (T) by Corollary fl5|. But 



D n (zw l ) p (H , F) (w) dn (w) 



is continuous by the conditions on m , and £ n — > p(H ,F) in L 2 (T) by |[MePa93| and 
Corollary |4.5| . Suppose now that (|a|) is given: the argument from above then shows that ([[]) 
holds in the sense of Ep (H , F) (1) = 1. □ 

For the general approximation problem M n h <p, for h, tp e TC, Mtp = <p, we have 
the following simple result. 

PROPOSITION 9.3. Consider a system (M,R,H) as in Theorem §3 with M a 
given sub-isometry and Ti = B © the canonical decomposition. Let h = hs + h^, 

h B E B, hoo ^ be given. 

Then 

(9.5) \\(p-M n h\\ > \\(p-M n h o0 \\ , 

so that h^ always gives a better approximation to tp = Mtp. 



Proof. From Corollary |J, we have M n h B G B and M n h OQ G H {oo) for all n = 0, 1, 2, ... , so 
(9.6) p - M n h = <p- M n h OQ + M n h B _, 



and the result is immediate from this. 



□ 



When Proposition O is applied to the scaling function ip := Xr_ i V from Ex- 
ample |8.6| , we arrive at the following necessary condition on a function h G L 2 (R) for 
||M n /i-(^|| 2 — ► to hold. 

n— >oo 

COROLLARY 9.4. Let ip := Xr ,r] v , and let h G L 2 (R) k pen. Assume i/iai the 
Fourier transform h is continuous on R. T/jen a necessary condition for \\M n h — (p\\ 2 — > 

n— >oo 

is t/iat 

(9.7) |^(27rn) 

n£Z\{0} 



i.e., h = on {2irn \ n G Z \ {0}}. Specifically, if the sum in ([9.7|) is 7^ 0, i/ien t/ie cascade 
algorithm diverges on h. 



Proof. In the discussion of Example 8.6, we showed that 



BII2 = / R n {P2(h B )){z) d(i(z), 

T 



and, as n — > 00, the limit of that term is P2 (h B ) (z = 1). Let z = e iu} , —n < uj < n. Then 
we saw that 

(9.8) p 2 (h B ) (e~ i0J ) = |M^ + 27rn)| 2 . 

n£Z\{0} 

This follows from the identity 

hB H = Xk\[- W)W > M ft H , w G R, 

from Example |8l| Hence 

lim ||M n /i B ||2 = V |/z (2tt0 2 , 

iez\{o} 

and it is clear from (|9.6| ) that M n /i ip in L 2 (R) if this term is nonzero. □ 



REMARK 9.5. A simple calculation shows that the sum Yln^o * n ( |9-7p is nonzero 

if we take h = = ^X[ob) wnen ^ ^ irrational. (The argument can be done with the 

Poisson summation formula.) 

Another example where the sum ( |9.7| ) is nonzero is a Gaussian, e.g., h(x) = 

2 

h (27m) 



1 x ^ 

7r~2e~~. The nonzero sum ^ n 



sum formula; see, e.g., |PyMc72| , p. 140 



can be worked out from Gauss's lattice 



for details. 



We finally note the following application to the cascade operator M in L 2 (R) as- 
sociated with some given filter tuq satisfying di|)-(0) in the Introduction. For f,h G L 2 (R), 
recall the form 



p (f, h) (z) = z n \ f ( x ~ n )h ( x ) dx. 



L 1 (T) 



The results in Section [5] show that 

p: L 2 (R) x L 2 (. 
is well defined and bounded, i.e., 

lb(M)Hi < uj 112 

for all f,hE L 2 (R). 

Returning to the approximation problem 

M n h — >tp, h = h B + h OQ , M<p = <p, 

suppose z i — p(f,h) (z) is continuous when /, h (as a pair) is picked from tp,h B ,hoQ. We 
have the following: 

PROPOSITION 9.6. If tp satisfies p 2 (<p) = p (<p, tp) = I, then 

v{f-Ko,hB) (z = 1) = 0. 

Proof. We have 

0=(<p-M n h oo \M n h B ) LHm 

p(i P -M n h 00 ,M n h B ) (z) dfi(z) 
p(M n ip,M n h B ) dn{z) - [ p (M n h 00 ,M n h B ) dp(z) 



T 

n— >oo 

and the claim follows. 



R n (p(<p,h B ))(z) dfx(z)- / R n (p(h 00 ,h B ))(z) dfi(z) 

Jt 

p(ip,h B ) (1) -p(/ioo,^B) (1) , 



□ 



The last step was based on the idea from the proof of Theorem |9.2| above, and the 
Meyer-Paiva | |MePa93|] result. The latter states that, if M and (p are such that p 2 (<p) = 1, 
then, for all continuous functions £ on T, we have 

lim / (R n 0(z) d/*(z)=£(l). 
An alternative approach to this limit problem is also given in |Kea72|| and ||lrtue9U 
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